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iB! enzymes having alpha-galactosidase activity 

and METHODS OF USE THEREOF 

«° RELATED APPLICATIONS 

This application clatms the benefit of U.S. Application Sena! No.09/619,032, 
filedJuly^OOO.now pending; which is a divisional of U.S. Application SenalNo^ 
9/407 S06, filed Sep,e m be r 20, 1 999;whichisad I v I sionalofU,.Appl 1 cat 1 o„Sena, N o. 

08/6B.220, filed March 8, 19 9MowU.S. Paten. 5,958,751, all of which areheretn 

incorporated by reference in their entirety. 

FIELD OF THE INVENTION 

mis inventionrela.es generally to enzymes, polynucleotides encodrng ,h< ;e„zymes<he 
.eofsuchpoiynucleot.desandpolypeptrdcs.andmorespecficaUy to enzymes havrng aipha- 

galactosidase activity. 

BACKGROUND 

Alpha galactosidase 1S an enzyme which hydropses the non-reducing terminal alpha 1 

gassy foods. 

Thepublicationsdiseussedherein are provided solely for mcir disdosure prior to the 
25 invention is not entitled to antedate such disclosure by virtue of prior inventton. 
, SUMMARY OF THE INVENTION 

30 polypeptides having alpha-galactosidase activity. 
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.hereto, and the sequences complementary thereto, 
such sequences. 

5 , tofme inventionisanisolatednucleicacidencodingapolypeptideora 

» Another aspect of the invention. t forth in SEQ ID NO.: 4, and sequences 

S substantially identical thereto. 

I ^eastlOconsecutiveaminoacidsofasequenceassetformmSEQIDNO 

II substantially identical thereto. 

set forth in SEQ ID NO, 4, and sequences —ally identical thereto. 

25 to a polypeptide having a sequence as set forth in SEQ U> 
identical thereto. 

30 one of the polypeptides of SEQ ID NU.. 4, 
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. f rfh in SEO ID NO • 4 and sequences substantial* identical thereto. The method 

expression of the nucleic acid. 

t fnrth in SEO ID NO : 3, sequences substantially identical 
nucleic acid having a sequence as set forth in ^ ^ . 3 figments comprising at 

thereto sequences complementary to the sequences of SEQ ID NO.. 1, gm 

sequence, or adding one or more nucleotides to the sequence. 

Mother aspecofthe invention is a computer readable medium having stored thereona 

polypeptide ^--« ta SBQlDHa:<-^-— 

thereto. 

storag e device wherein the data storage device has stored thereon a sequence as set forth SEQ 
se, forth in SEQ ID NO, 4, and sequences substantially identical thereto. 

sequence wherein the first sequence is a nucleic acid having a sequence as set forth in SEQ ID 
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NO • 3 and sequences substantial* identical thereto, or a poiypept.de code of SEQ ID NO, 4, 
^sequences '..^^^ ^^^^^^ 



program. 



Another aspect of the invention is a method for identifying a feature in a sequence as set 
forth in SEQ ID NO.: 3, and sequences substantially identical thereto, or a polypeptide having a 

0 readingthesequencethroughtheuseofacomputerprogratnwhichrdentinesfeaturesrn 
sequences; and identifying features in the sequence with the computer program. 

! Motheraspectoftheinven,^ 

| polyp e P tideandsubstia,etherebyidenti^^ 

BRIEF DESCRIPTION OF THE DRAWINGS 

Thefollowing drawings are illustiative of embodiments of the invention and are not 
meant to limit the scope of the invention as encompassed by the clauns. 
Figure 1 is a block diagram of a computer system. 

FigurelisaflowdiagramiUustratingoneembodunentofaprocessforcompannga 

_ nucleotide or protein sequence with a database of sequences in order to detemunethe 
homologylevelsbetweenthenewse^uenceandthesequencesinthedatabase. 

Figure 3 is a flow diagram illustrating one embodiment of a process m a computer 
for determining whether two sequences are homologous. 

Fig „ re 4 rs a flow diagram illustrating one embodiment of an tdentrfier process 300 
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invention. Sequencing was performed using a 378 automated DN A sequencer (Applied 
5 Biosystems, Inc.)- 

DETAILED DESCRIPTION OF THE INVENTION 

Tnepolynucleotidesofthe— ^ 

5 having a-galactosidase activity. 
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material, natural or synthetic in origin. 

^tnuclLsequenoew^is—ed and — d into a po ly pept ld e or 
protein whenplaced under the controi of appropriate reguiatory sequences. 

- " means the segment of DNA involved in producing a polypepttde chain; 
where applicable, intervening sequences (introns) betwe 

naturally occurring or synthetic molecules. 
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degrees at several sites in a given poiypept.de. Also gtven P »P ^ 

nation, -^T^^aP!---*--* 

pvroglutan^rntW 

^oxylatron.—n.rnetltylatio, 

processing.phosphoryiation.ptenylanon.raee™^ 

a^"^^^ Johnson Ed., Academic Press, 

(199 3); Postanal Content Motion of Prated B.C. Johnson, 

New York, pp. 1-12(1983)). 

^e.herein.thete™— 

vector or composition is not part of it. natural environment. 

30 
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25 



purified* e,ectrophore«cho m oge„e,<y. ^"^» ^fiednucierc acids of 

le as.l0-10 fold. How r&omotherseque „ce S inaUbra I yorother 

four or five orders of magnitude. 

» HWin fteterm-reeombinanrmeansthatthenueleicaeidisadjaeenttoa 
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fUM awen et al Proc. Natl. Acad. Sci., 

r sfora ~i s-t;^- 

such a process step, i.e., inverting <m b -i„KUT7MOr nentide 



techniques. 



go 



,he art and wiU be apparent to the ordinarily skilled arttsan. 

• ..nfDNAreferstocatalyticcleavageoftheDNAwitharestrietionenzyme 

e of isolating DNA fragments for plasmid construction, typically 5 to 
, meubationtimesof about 1 ho»rat37°C are ordntanly used, but may vaty 
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desired fragment. 

synthetic oligonucleotides have no 5 phosphate and th nce 0 f a kinase. A 

:L"^r.r;..— — ;rr- 

150 -200 residues. In some embodhnents, the sequences are substanttally 

length of the coding regions. 

-r-sssssss 

not the active site of the molecule, an P substitutes one 
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10 toctionalactwttyofthes^ 

3 *»**>• FragmentswhKhhav edtfferent decule , such . a ,ow activity 

§ higher activity. 

!l — canbedennedby.for 

itis present at iow concentrates. lys«ng ^ hybridizat ion 

50% formamide at about 37 C to 42 C. tiy ^ 
cond i,ionsina b out35»/oto 2 5%— ^^^^^.^SX 

eouidooeurunder reduced stringency condtuons as descnbed above, 
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h of35 . C Thetemperaturerangeoorrespondingto.part.cul^evd 

of stnngency can be forth- narrowed by calcnlat, g P ^ ^ ^ 

^eieaadofinterestandadinsUngnretenrperaPoreaeeordrngy 

rang es and conditions are wen known in the art. 

7 taction in organic solvents, operate a. « res) , extrem e salinity 

1 ext.ernetenrperatnres^exarnple.hrgbtenrpe^ 

ill levels (for example, high sahnity and low sahnrty), and ca*lyz 

many diverse reactions nnrelated.o then physio S peroxidases can also 

— ° n :::i rr— i*--^— - 
30 ^r^^.-— --^-""^^ 
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c rtf priz vines. Whereas the 
transformations normally requires the ^j^^ selected biocatalysts and reaction 

i f n of derivatives from a single starting 

componnd.Thereactionseqnencersrepeat^ d s « approaches, does 

compounds canbeprod^^^ 
m olecules, S eePCT/US94/09174,hereininco^ 



20 



25 



-12- 



G, a yCary\GT\6228967.1 
104703-156590 



assembled non-stochastically. 

«f * hieh level of homology between 

Q achieved. 

I asse.nb.eda.e—edtobe-s— for yP ^ ^ ^ 

- onnaingMoefcstooeeonplea^ 

treated with an enzyme, snch as a ligase (eg. , T4 DNA 
the building pieces. 

„ *r^J2=r--— ------ 



2§ 
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^eioacidbuildingbWstotare.obe mutagen, ctume 

products are enzymes. The a-galactos,dase of the present 
accordance with the methods described herem. 

j| molecules. 



25 



30 



demarcationpointeanbeanareaofhomdogythat^ar y urths of the prog eni.or 
templates. 

products is greatly reduced. 
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, that the gene reassembly process is 

ent,meme,hodpro^ ^^^^^^ 

0 examined systematica^ in smaller groups. 

S e Xh an S tWean d s y stema,ieasweU, P — ^"^rf^.-, 

1 pm.nitormolecnles^e.nsta.tinvenuon^fo <^ ..^.a,..,*. 
1 comp^o.larsennm.roprosen ^ g ^ ed prefe ra bly comprise a 

13 i„stantgenereas S emblymven„on>epoge cules ha ving an ovemU assembly order that >s 

:= chosen by design. iua F ™ n wular species. 

511 3 than 10 1000 different progeny molecular v 

m than 10 3 to greater than iu 

»-.»--' 

plant). 

H nfthe step in which the building blocks 
mother exempl—thesyntheticna^ 
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, orocessleg b y muta g enesis)orm ani nv,Voprooess(,g.,by 
optionally removed in an m ™o process (e.g., y ^ jn ^ „ s the 

— ^eanoeused.0— ^,n^ The invent ion also prov.es 
intronsmaybeintrodueedrmoaman-mdege enepalhwayo f Reinvention. 

■ rbaHsa—de.enepafcw^ 
Preferably.theartifrei^^^ 

— .enep— 
recornb-on-ano^^ 

the invention ean also serveasasnbstratefo ^ 
preferred—, tberec^^^ 

more) « 1 ntrodneed 1 n tt on(s,intbe m an-n,ade g ene. 
bul , dl „ g bloe k s,eacbo f wbicbpre f erablybas W o,, g atablee„d, 
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overhangs. 

™v be a 3' overhang or a 5' overhang. Thus, a 
nu cleicacidbuildingblockmayhavea3 overhang ^ ^ ^ ^ 

; srs^rrsi 

h building block. 

P (including every integer value in between). 

generated that is serviceable for the mventton, and these are 
25 performed by the skilled artisan. 



1 
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described herein, using one or more jn,in,vj/ 



cassettes. 



10 



Theapproachofusingrecon*^^ 



recombination. 



30 



^v^shufflingofmoleculesisuserulinprovidingvariantsand 
J» vivo shunimg Primers While recombination in vivo has 

utilizing the natural property of cells to recombme multimers. While 
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finally 3) the resolution of chiasma into discrete recombined molecules. The formation of the 
chiasma requires the recognition of homologous sequences. 

In another embodiment, the invention includes a method for producing a hybrid 
5 polynucleotide from at least a first polynucleotide and a second polynucleotide. The inventton 
can be used to produce a hybrid polynucleotide by introducing at leas, a first polynucleot.de and 
a second polynucleotide which share at least one region of partial sequence homology mto a 
suitable host cell. The regions of partial sequence homology promote processes which result m 
sequence reorganization producing a hybrid polynucleotide. The term "hybrid polynucleohde", 
,0 asusedherein,i S anynucleotidcseque„cewhichre S ultsfromthemethodof.he P resent 

invention and contains sequence from at least two original polynucleotide sequences. Such 
hybridpolynucleotidescanresultftommtermolecularreeombinationeventswhichpromote 
I S e q uenceintegra.ionbetweenDNAmolecules.Inaddition,suchh y bridpolynucleotidescan 
il result from intramolecular reductive reassortment processes which utilize repeated sequences to 
| alter a nucleotide sequence within a DNA molecule. 

° The invention provides a means for generating hybrid polynucleotides which may encode 

S biologically active hybrid polypeptides (, g ., hybrid a-galactosidase). In one aspect, the original 
iU polynucleotidcseneodebiologicallyacivepolypeptides. The method of the invention produces 
i new hybrid polypeptides by utilizing cellular processes which integrate the sequence of the 
ji original polynucleotides such that the resulting hybrid polynucleotide encodes a polypepfde 

demonstrating activities derived from the original bio.ogically active polypeptides. For example, 
the original polynucleotides may encode a particular enzyme from different microorgarusms. An 
enzyme encoded by a first polynucleotide from one organism or variant may, for example, 
25 function effectively under a particular environmental condition, e.g. high salinity. An enzyme 
encoded by a second polynucleotide from a different organism or variant may functton 
effectively under a different environmental condition, such as extremely high temperatures. A 
hybrid polynucleotide containing sequences from the first and second original polynucleotides 
may encode an enzyme which exhibits characteristics of both enzymes encoded by the ongmal 
30 polynucleotides. Thus, the enzyme encoded by the hybrid polynucleotide may functton 
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effectively under environmental conditions shared by each of the enzymes encoded by the first 
and second polynucleotides, e.g., high salinity and extreme temperatures. 

Enzymes encoded by the polynucleotides of the invention include, but are not limited to, 
5 hydrolases, such as ct-galactosidases. A hybrid polypeptide resulting from the method of the 
invention may exhibit specialized enzyme activity not displayed in the original enzymes. For 
example, following recombination and/or reductive reassortment of polynucleotides encoding 
hydrolase activities, the resulting hybrid polypeptide encoded by a hybrid polynucleotide can be 
screened for specialized hydrolase activities obtained from each of the original enzymes, i.e. the 
1 0 type of bond on which the hydrolase acts and the temperature at which the hydrolase functions. 

Thus, for example, the hydrolase may be screened to ascertain those chemical functionalities 
13 which distinguish the hybrid hydrolase from the original hydrolases, such as: (a) amide (peptide 
!K bonds), i.e., proteases; (b) ester bonds, i.e., esterases and lipases; (c) acetals, i.e., glycosidases 
J5 and, for example, the temperature, pH or salt concentration at which the hybrid polypeptide 
£5 functions. 

Sources of the original polynucleotides may be isolated from individual organisms 
'A ("isolates"), collections of organisms that have been grown in defined media ("enrichment 
: Z cultures"), or, uncultivated organisms ("environmental samples"). The use of a 
3D culture-independent approach to derive polynucleotides encoding novel bioactivities from 

environmental samples is most preferable since it allows one to access untapped resources of 

biodiversity. 

"Environmental libraries" are generated from environmental samples and represent the 
25 collective genomes of naturally occurring organisms archived in cloning vectors that can be 

propagated in suitable prokaryotic hosts. Because the cloned DNA is initially extracted directly 
from environmental samples, the libraries are not limited to the small fraction of prokaryotes that 
can be grown in pure culture. Additionally, a normalization of the environmental DNA present 
in these samples could allow more equal representation of the DNA from all of the species 
30 present in the original sample. This can dramatically increase the efficiency of finding 
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interesting genes from minor constituents of the sample which may be under-represented by 
several orders of magnitude compared to the dominant species. 

For example, gene libraries generated from one or more uncultivated microorganisms are 
screened for an activity of interest. Potential pathways encoding bioactive molecules of interest 
are first captured in prokaryotic cells in the form of gene expression libraries. Polynucleotides 
encoding activities of interest are isolated from such libraries and introduced into a host cell. 
The host cell is grown under conditions which promote recombination and/or reductive 
reassortment creating potentially active biomolecules with novel or enhanced activities. 

The microorganisms from which the polynucleotide may be prepared include prokaryotic 
microorganisms, such as Eubacteria and Archaebacteria, and lower eukaryotic microorganisms 
such as fungi, some algae and protozoa. Polynucleotides may be isolated from environmental 
samples in which case the nucleic acid may be recovered without culturing of an organism or 
recovered from one or more cultured organisms. In one aspect, such microorganisms may be 
extremophiles, such as hyperthermophiles, psychrophiles, psychrotrophs, halophiles, barophiles 
and acidophils. Polynucleotides encoding enzymes isolated from extremophilic 
microorganisms are particularly preferred. Such enzymes may function at temperatures above 
100°C in terrestrial hot springs and deep sea thermal vents, at temperatures below 0°C in arctic 
waters, in the saturated salt environment of the Dead Sea, at pH values around 0 in coal deposits 
and geothermal sulfur-rich springs, or at pH values greater than 1 1 in sewage sludge. For 
example, several esterases and lipases cloned and expressed from extremophilic organisms show 
high activity throughout a wide range of temperatures and pHs. 

Polynucleotides selected and isolated as hereinabove described are introduced into a 
suitable host cell. A suitable host cell is any cell which is capable of promoting recombination 
and/or reductive reassortment. The selected polynucleotides are preferably already in a vector 
which includes appropriate control sequences. The host cell can be a higher eukaryotic cell, such 
as a mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or preferably, the host cell 
can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell 
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can be effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation (Davis et al, 1986). 

As representative examples of appropriate hosts, there may be mentioned: bacterial cells, 
such as E. coli, Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells 
such as Drosophila S2 and Spodoptera Sj9\ animal cells such as CHO, COS or Bowes 
melanoma; adenoviruses; and plant cells. The selection of an appropriate host is deemed to be 
within the scope of those skilled in the art from the teachings herein. 

With particular references to various mammalian cell culture systems that can be 
employed to express recombinant protein, examples of mammalian expression systems include 
the COS-7 lines of monkey kidney fibroblasts, described in "SV40-transformed simian cells 
support the replication of early SV40 mutants" (Gluzman, 1981), and other cell lines capable of 
expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor and 
acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed sequences. 
DNA sequences derived from the SV40 splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. 

Host cells containing the polynucleotides of interest can be cultured in conventional 
nutrient media modified as appropriate for activating promoters, selecting transformants or 
amplifying genes. The culture conditions, such as temperature, pH and the like, are those 
previously used with the host cell selected for expression, and will be apparent to the ordinarily 
skilled artisan. The clones which are identified as having the specified enzyme activity may then 
be sequenced to identify the polynucleotide sequence encoding an enzyme having the enhanced 
activity. 

In another aspect, it is envisioned the method of the present invention can be used to 
generate novel polynucleotides encoding biochemical pathways from one or more operons or 
gene clusters or portions thereof. For example, bacteria and many eukaryotes have a coordinated 
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mechanism for regulating genes whose products are involved in related processes. The genes are 
clustered, in structures referred to as "gene clusters," on a single chromosome and are transcribed 
together under the control of a single regulatory sequence, including a single promoter which 
initiates transcription of the entire cluster. Thus, a gene cluster is a group of adjacent genes that 
are either identical or related, usually as to their function. An example of a biochemical pathway 
encoded by gene clusters are polyketides. Polyketides are molecules which are an extremely rich 
source of bioactivities, including antibiotics (such as tetracyclines and erythromycin), anti-cancer 
agents (daunomycin), immunosuppressants (FK506 and rapamycin), and veterinary products 
(monensin). Many polyketides (produced by polyketide synthases) are valuable as therapeutic 
agents. Polyketide synthases are multifunctional enzymes that catalyze the biosynthesis of an 
enormous variety of carbon chains differing in length and patterns of functionality and 
cyclization. Polyketide synthase genes fall into gene clusters and at least one type (designated 
type I) of polyketide synthases have large size genes and enzymes, complicating genetic 
10 manipulation and in vitro studies of these genes/proteins. 

Q Gene cluster DNA can be isolated from different organisms and ligated into vectors, 

!"* particularly vectors containing expression regulatory sequences which can control and regulate 
|1 the production of a detectable protein or protein-related array activity from the ligated gene 
iU clusters. Use of vectors which have an exceptionally large capacity for exogenous DNA 
§) introduction are particularly appropriate for use with such gene clusters and are described by way 
! " of example herein to include the f-factor (or fertility factor) of £ coll This f-factor of £ coli is 
a plasmid which affect high-frequency transfer of itself during conjugation and is ideal to 
achieve and stably propagate large DNA fragments, such as gene clusters from mixed microbial 
samples. A particularly preferred embodiment is to use cloning vectors, referred to as "fosmids" 
25 or bacterial artificial chromosome (BAC) vectors. These are derived from E. coli f-factor which 
is able to stably integrate large segments of genomic DNA. When integrated with DNA from a 
mixed uncultured environmental sample, this makes it possible to achieve large genomic 
fragments in the form of a stable "environmental DNA library." Another type of vector for use 
in the present invention is a cosmid vector. Cosmid vectors were originally designed to clone 
30 and propagate large segments of genomic DNA. Cloning into cosmid vectors is described in 
detail in Sambrook et al, MoW.nlar Cloninr: A Laboratory Manual, 2nd Ed. , Cold Spring 
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Harbor Laboratory Press (1989). Once ligated into an appropriate vector, two or more vectors 
containing different polyketide synthase gene clusters can be introduced into a suitable host cell. 
Regions of partial sequence homology shared by the gene clusters will promote processes which 
result in sequence reorganization resulting in a hybrid gene cluster. The novel hybrid gene 
5 cluster can then be screened for enhanced activities not found in the original gene clusters. 

Therefore, in a one embodiment, the invention relates to a method for producing a 
biologically active hybrid polypeptide and screening such a polypeptide for enhanced activity by: 
1) introducing at least a first polynucleotide in operable linkage and a second 
! o polynucleotide in operable linkage, said at least first polynucleotide and second 

polynucleotide sharing at least one region of partial sequence homology, into a 
i=i suitable host cell; 

^3 2) growing the host cell under conditions which promote sequence reorganization 

ip 

i q resulting in a hybrid polynucleotide in operable linkage; 

1 3) expressing a hybrid polypeptide encoded by the hybrid polynucleotide; 

* 3 4) screening the hybrid polypeptide under conditions which promote identification of 

enhanced biological activity; and 
E 5) isolating the a polynucleotide encoding the hybrid polypeptide. 

20 Methods for screening for various enzyme activities are known to those of skill in the art 

|s * and are discussed throughout the present specification. Such methods may be employed when 
isolating the polypeptides and polynucleotides of the invention. 

\^&7 As re P resenta V exam P les of expression vectors which may be used, there may be 
25 motioned viral particleVaculovirus, phage, plasmids, phagemids, cosmids, fosmids, bacterial 
artificial chromosomes, vrra^DNA (e.g., vaccinia, adenovirus, foul pox virus, pseudorabies and 
derivatives of SV40), Pl-basekUrtificial chromosomes, yeast plasmids, yeast artificial 
chromosomes, and any other vecWs specific for specific hosts of interest (such as bacillus, 
aspergillus and yeast). Thus, for example, the DNA may be included in any one of a variety of 
30 expression vectors for expressing a polypeptide. Such vectors include chromosomal, 

nonchromosomal and synthetic DNA seXences. Large numbers of suitable vectors are known 
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to thoseofski.Hnthe art, and are conunercially available. The following vectors areprovided 
by way of example; Bacterial: pQE vectors (Qiagen), pBluescrip. plasmids, P NH vectors, 
(lanlb da-ZAP vectors (Stratagene); P trc99a, P KK223-3, P DR540, pRTT2T (Pharmacia); 
E ^aryo,ic:pXTl,pSG5(Stra,age„e),pSVK3,pBPV,pMSG,pSVLSV40(Phar m acta). 
; However.anyo.herp.asm.dorothervectormaybeusedsolongas.heyarerephcableat.dv.able 
in the host. Low copy number or Hghcopy number vectors may be employed whhme present 

invention. 

The DNA sequence in the expression vector is operatively linked to an appropriate 

0 expressioncon,ro.sequence(s)(p— ^ 

promoters include /ac/,/^,^^ 

1 mousemetal.othionein-1. Selection of the appropriate vector and promoter is wellwrthin the 

1 ievelofordinaryskillintheart. The expression vector a!so contams a ribosome b,„dmgs,te for 
B translation initiation and a transcription terminator. The vector may also include appropriate 

= sequencesforamplifvingexpression. Promoter regions can be selected from any desned gene 
3 usmgcWoramphenicoltransf^ 

2 addi«ion,meexpressio»vec,ors P referahlycontainoneormorese,ec,ablemar k ergenesto 
^ p^deaphenotypictraitforselectronoftransformedhostcellssuch. 

I Leomycinresistanceforeuxa^^ 

in E. coli. 

In viw reassortment is focused on "inter-molecular" processes collective* referred to as 
.,ec„mbina,io„"whichinbac,ena, 1 s generally viewed as a "RecA-dependent" phenomenon. 

25 The invention can rely on recombination processes of a host cell to recombine and re-assort 
sequences, or the ce.is' ability to mediate reductive processes to decrease the complex,* of 
quasi-repeated sequences inthecellby deletion. This process of 'Veductivereassorimen, occurs 
by an "intra-molecular", RecA-independent process. 

30 T herefore,inanomeraspectof m einve„tion,novelpolynucleo,idescanbegeneratedb y 
the process of reductive reassortment. The method involves the generation of constructs 
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appropriate vector, and their subsequent mtroduction into an appropnate host celi. The 
T onsecutive sequences in the construe, possessing re S ions of hontoiogy, or between ^ 

oftherepeated seances, — ~ 

Repeated or "^-repeated" sequences piay a role in genetic instabiii^. In the present 
invention « «ep^^ «»t^ ^ <^ «^ ^ -"^ 

7, etncLarievel. ThedeietionprocessdteceUperformstoreducethecomple.tyoft e 
repetitive units. 

headtotailorviceversa.thecellcar.otdistinguishindividuainnits. Consequent* the 

^ 0 r ta ^^^^■ H ^•' ,,,,,^,,,,,,, * ,,,^0, 
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the quasi-repeated sequences in the same orientation to allow higher effictency. 

Sequences can be assembled in a head to tail orientation using any of a variety of 

methods, including the following: 

a) Primersthatincludeapoly-Aheadandpoly-Ttailwhichwhenmadesmgle- 

stranded would provide orientation can be utilized. This is accomplished by havtng 
theftrstfewbasesoftheprimersmadefromRNAandhenceeasilyremoved 

RNAseH. t . , . 

Primers that include unique restriction cleavage sites can be utilized. Multiple st.es, 
a battery of unique sequences, and repeated synthesis and ligation steps would be 
required. 

c) The inner few bases of the primer could be thiolated and an exonuclease used to 
produce properly tailed molecules. 

The recovery of the re-assorted sequences relics on the identifcation of cloning vectors 

with reduced RI can be affected by: 

The useofvectors only stably maintained when the construct is reduced m complextty. 

2) Thephysicalrecoveryofshortenedvectorsbyphysicalprocedures. In tins case, the 
eloningvectorwouldberecoveredusmgstandardplasmtdisolationproceduresandstze 

fractionated on either an agarose gel, or cotamn with a low molecular weight cut off 
utilizing standard procedures. 

3) The recovery of vectors containing interrupted genes which can be selected when msert 

size decreases. 

4) The use of direct selection techniques with an expression vector and the appropnate 
selection. 



1) 
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Encoding sequences (for example, genes) from related organisms may demonstrate a lugh 
degree of homology and encode quite diverse protetn products. These types of sequences are 
particularly useful in the present invention as quasi-repeat, However, while the examples 
iUustrated below demonstrate the reassortment of nearly identicai original encoding sequences 
(quasi-repeats), this process is not limited to such nearly identical repeats. 

The following example demonstrates a method of the invention. Encoding nucleic acid 
sequences (quasi-repeats) derived from three (3) unique species are described. Each sequence 
encodes aprotein with adistinet se, of properties. Each of the sequences differs by a single ora 
few base pairs at a unique posttion in the sequence. The quasi-repeated sequences are separately 
or collectively amplified and ligated into random assemblies such mat all possible permutations 
and combinations are available in the population of ligated molecules. The number of quast- 
repeat units can be controlled by the assembly conditions. The average number of quast- 
repeated units in a construct is defined as the repetitive index (RI). 

Once formed, the constructs may, or may not be size fractionated on an agarose gel 
according to published protocols, inserted into a cloning vector, and transfected into an 
appropriate host cel.. The cells are then propagated and "reductive reassortment" is effected. 
The rateofthe reductive reassortment process may be stimulated by the introduction of DNA 
, damage if desired. Whether the reduction in RI is mediated by deletion formation between 
repea,edsequencesbyan"in tt a-molecular-'mechanism,ormediatedb y recombmation.hke 

events through "inter-molecular" mechanisms is tmmaterial. The end result is a reassortment of 
the molecules into all possible combinations. 

5 Optionally, the method comprises the additional step of screening the library members of 

the shuffled pool to identify individual shuffled library members having the ability to bind or 

with apredetermined macromolecule, such as for example a proteinaceous receptor, an 
oligosaccharide, viron, or other predetermined compound or structure. 
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The polypeptides that are identified from such libraries can be used for therapeutic, 
diagnostic, research and related purposes (e.g., catalysts, solutes for increasing osmolality of an 
aqueous solution, and the like), and/or can be subjected to one or more additional cycles of 
shuffling and/or selection. 

In another aspect, it is envisioned that prior to or during recombination or reassortment, 
polynucleotides generated by the method of the invention can be subjected to agents or processes 
which promote the introduction of mutations into the original polynucleotides. The introduction 
of such mutations would increase the diversity of resulting hybrid polynucleotides and 
polypeptides encoded therefrom. The agents or processes which promote mutagenesis can 
include, but are not limited to: (+)-CC-1065, or a synthetic analog such as (+)-CC-1065-(N3- 
Adenine (See Sun and Hurley, (1992); an N-acelylated or deacetylated 4'-fluro-4-aminobiphenyl 
adduct capable of inhibiting DNA synthesis (See , for example, van de Poll et al. (1992)); or aN- 
acetylated or deacetylated 4-aminobiphenyl adduct capable of inhibiting DNA synthesis (See 
also, van de Poll et al. (1992), pp. 751-758); trivalent chromium, a divalent chromium salt, a 
polycyclic aromatic hydrocarbon (PAH) DNA adduct capable of inhibiting DNA replication, 
such as 7-bromomethyl-benz[a]anthracene ("BMA"), tris(2,3-dibromopropyl)phosphate ("Tns- 
BP"), l,2-dibromo-3-chloropropane ("DBCP"), 2-bromoacrolein (2BA), benzo[a]pyrene-7,8- 
dihydrodiol-9-10-epoxide ("BPDE"), a platinum(II) halogen salt, N-hydroxy-2-amino-3- 
methylimidazo[4,5-y]-quinoline ("N-hydroxy-IQ"), and N-hydroxy-2-amino-l-methyl-6- 
phenylimidazo[4,5-y]-pyridine ("N-hydroxy-PhIP"). Especially preferred means for slowing or 
halting PCR amplification consist of UV light ( + )-CC-1065 and ( + )-CC-1065-(N3-Adenine). 
Particularly encompassed means are DNA adducts or polynucleotides comprising the DNA 
adducts from the polynucleotides or polynucleotides pool, which can be released or removed by 
a process including heating the solution comprising the polynucleotides prior to further 
processing. 

In another aspect the invention is directed to a method of producing recombinant proteins 
having biological activity by treating a sample comprising double-stranded template 
polynucleotides encoding a wild-type protein under conditions according to the invention which 
provide for the production of hybrid or re-assorted polynucleotides. 
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The invention also provides for the use of proprietary codon primers (containing a 
degenerate N,N,N sequence) to introduce point mutations into a polynucleotide, so as to generate 
a set of progeny polypeptides in which a full range of single amino acid substitutions is 
5 represented at each amino acid position (gene site saturated mutagenesis (GSSM)). The oligos 
used are comprised contiguously of a first homologous sequence, a degenerate N,N,N sequence, 
and preferably but not necessarily a second homologous sequence: The downstream progeny 
translations products from the use of such oligos include all possible amino acid changes at each 
amino acid site along the polypeptide, because the degeneracy of the N,N,N sequence includes 
10 codons for all 20 amino acids. 

In one aspect, one such degenerate oligo (comprised of one degenerate N,N,N cassette) is 

I US ed for subjecting each original codon in a parental polynucleotide template to a full range of 

1 codon substitutions. In another aspect, at least two degenerate N,N,N cassettes are used - either 

I in the same oligo or not, for subjecting at least two original codons in a parental polynucleotide 

£ template to a full range of codon substitutions. Thus, more than one N,N,N sequence can be 

0 contained in one oligo to introduce amino acid mutations at more than one site. This plurality of 

0 N,N,N sequences can be directly contiguous, or separated by one or more additional nucleotide 

1 sequence(s). In another aspect, oligos serviceable for introducing additions and deletions can be 
§) used either alone or in combination with the codons containing an N,N,N sequence, to introduce 
K any combination or permutation of amino acid additions, deletions, and/or substitutions. 

In a particular exemplification, it is possible to simultaneously mutagenize two or more 
contiguous amino acid positions using an oligo that contains contiguous N,N,N triplets, i.e. a 
25 degenerate (N,N,N) n sequence. 

In another aspect, the present invention provides for the use of degenerate cassettes 
having less degeneracy than the N,N,N sequence. For example, it may be desirable in some 
instances to use {e.g. in an oligo) a degenerate triplet sequence comprised of only one N, where 
30 said N can be in the first second or third position of the triplet. Any other bases including any 
combinations and permutations thereof can be used in the remaining two positions of the triplet. 
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Alternatively, it may be desirable in some instances to use (e.g., in an oHgo) a degenerate N,N,N 
triplet sequence, N,N,G/T, or an N,N, G/C triplet sequence. 

It is appreciated, however, that the use of a degenerate triplet (such as N,N,G/T or an 
N N G/C triplet sequence) as disclosed in the instant invention is advantageous for several 
reasons. In one aspect, this invention provides a means to systematically and fairly easrly 
generate the substimtionof me full range of possible ammo acids (for a total of 20 amino actds) 
into each and every amino acid position in a polypeptide. Thus, for a 100 amino aetd 
polypeptide, the invention provides a way to systematically and fairly eastiy generate 2000 

or an N N G/C triplet sequence, 32 individual sequences that code for 20 possible amino acds. 
Thus in a reaction vessel in which a parental polynucleotide sequence ts subjected to saturate 
mutagenesis using one such oligo, there are generated 32 distinct progeny polynucleotides 
encoding 20 distinct polypeptides. ta conttast, the use of anon-degenerate oligo in site-d,rec,ed 
mutagenesis leads to only one progeny polypeptide product per reaction vessel. 

This invention also provides for the use of nondegenerate oligos, which can optionaHy be 
used in combination with degenerate primers disclosed. It is appreciated that in some situates, 
it is advantageous to use nondegenerate oligos to generate specific point mutations in a worktng 
polynucleotide. This provides a means to generate specific silent point mu.at.ons, pomt 
mutations leading to corresponding amino acid changes, and point mutations that cause the 
generation of stop codons and the corresponding expression of polypeptide fragments. 

Thus in a preferred embodiment of this invention, each saturation mutagenesis reaction 
vessel contams polynucleotides encoding at least 20 progeny P o ly pe P tide molecules such that all 
20 amino acids are represented at the one specific amino acid position correspondmg to the 
codon positionmutagenrzed in the parental polynucleotide. The32-fold degenerate progeny 
polypeptides generated from each saturation mutagenesis reaction vessel can be subjected to 
clonalamplification(,g.,clonedintoasu 1 table £ c 0 fiho S tusinganexpressionvec,or)and 

subjected to expression screening. When an individual progeny polypeptide is identified by 
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therein. 



10 



■•!= 



nations 2 at each of toe positions) and no change at any posttton. 
chta enzao„, r cco m binatio„ and other denizing processes, along w«h sc„ TTus 
m „tagenizingprocess(es)is used in combination with screenmg. 

polynucleotide is generated by recombination and reductive reassortment. 
tavennonprovidesthatmutagenesiscanbeusetoreplaceeachofanynumbero ba s,„a 

«v„ m 15to 100000 Thus, instead of mutagenizing every posmon along a molecule, on 



-32- 



GrayCary\GT\6228967.1 
104703-156590 



, Lmi^eauence A group of 3 positions to be mutagenized may be 
of positions along a polynucleotide sequence, a gro p 

acodonThemutationsarepreferablyintroducedusingamutagenicpnrner.contatmnga 
1 ,„ 500base,Ea,hnuc,eo,ide P os^^^ 

A/G, A/T, C/G, C/T, G/T f C/G/T, A/G/T, A/C/T, A/C/G, or E, where E is any base that is not A, 
C, G, or T (E can be referred to as a designer oligo). 

polylleo,idese q uence.obe m u.^ ft 

^Utionstoheintroducedintoasecondcassetteduringtheapphcationofoneroundof 

particular codons, and groupings of particular nucleonde cassettes. 

Denned sequences to be mu.ageni.ed include a whole gene, pathway, cDNA, an entire 
openreadingframefORF), and entire promoter, a^«^^-^^ 

include typesof amino acids encoded by a degenerate mutagenic cassette. 

(using degenerate oligos) that code for 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, ^ ' 
and*) amino acids at each position, and a library of polypeptides encoded thereby. 
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thereto or a fragment compnsing at least 10, 15, ZU, ZD, 

^solated^ nucleic acids may comprise DNA, including cDNA, genomic ^ be t h e 
DNATheDNAmay be double-stranded or single-stranded, and if single stranded may be 

comprise RNA. 

20, 25, 30, 35, 40, 50, 75, 100, or 150 conserve amino acids of SEQ ID NO.. 
substantially identical thereto. 

Accordingly.anodteraspectoftbe invention is an isolated nucleic acid which encodes 

,0 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of SEQ ID 

:; b ::ri::i:: <~«~ 

1997, the disclosure of which is incorporated herein by reference. 
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coding sequence. 



; i5 



14) 



and sequences substantially 
Alternatively, the nucleic add sequence of SEQ ID NO.. 3, q 

200 300 400, or 500 consecutive bases of SEQ ID NU.. J, 4 
stringency as provided herein. 

30 35 40 50 75, 100, 150, 200, 300, 400, or 500 consecutive bases of SEQ ID NO.. 3, 
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samp ,e Thenucleicacids are contacted with the probe under conditions which pemut the probe 
to specially hybridize to any complementary sequences from which are present teem. 

5 Where necessary, conditions which permit the probe to specifically hybridize to 

c „ m ple m en,aryse qU ences m aybedcten„incdby P laci» g theprobeincontac,w 1 th 

as controlsequenceswhichdo not contain the complement^ sequence. Hybndrzafon 
cond.tions.suchasthesal.concentrafonofthehybridizationbuffer.thefonnamrde 

10 concentrationofthehybridiz^^^ 

identify conditions which allow theprobe to hybridize specifically to complementary nucletc 

acids. 

p 

i If ,he sample contains the organism from which the nucleic acid was isolated, specific 

1 h.bridizationoftheprobeisthendetected. Hybridization may be detected by landing theprobe 

0 catalyzing the formation of a detectable product. 

1 Many methods for using the labeled probes to detect the presence of complementary 
| nucleicacidsinasamplearefamiliartothoseskilledintheart. These include Southern Bo ts , 
S HormentBlots.colonyhybridizationprocedures.anddotblots. Protocols for eac ofthe 

Sons inc (19 97) and Sambrook ^/., MaJsSul^^ 

SpringHarbor Laboratory Press(1989), the entire dtsclosures of which are incorporated herem by 

25 reference. 

Altemauvely.more man one probeiat least oneofwhich is capable of specifically 
hybridizing any complementary sequences which are present in the nucleic acid sample), may 
be used inanamplificationreactionto determine whether the sample contains an orgamsnt 

acidwa.isolate^.Typically.theprobescompriseoligonucleotides.Inoneembodtrnent.the 
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amp.incationreactionmay compriseaPCR reaction. PCRpro.ooo.s are described in Ausube, 

. , . „+; nn ^PP^aranvF "The Ligase Chain Reaction in a FCK 
3SR, or strand displacement reaction. (See Barany, r ., nc ^ g 

Wor,d",PC«^«^^— l:5-16 > 1991;E.Fahy^/./'Self— edSeouence^ 
Replication (3SR): An Isothermal Transcription-based Amplification System Alternative to PCR , 

AmpUficauon-anlsomerma™^ 

20-1691-1696 1992, the disclosures of which are incorporated herein by reference m then 
amplificatareacuonisperfo^^ 

^pUficanonproduc.maybedetectedbyperfornnnggelelectropboresisonthereact.onp^ 

product may be detected by autoradiography after gel electrophorests. 

Probesderived & omseonencesnear.heendsofSE Q rDNO,.3,andse q uences 

y subs^ntiaUyidenticalmereto. Such memods allow isolation of gene, winch encode addtbona, 
|p proteins from the host organism. 

The isolated nucleic acids of SEQ ID NO, 3, and sequences substantially identical 
thereto, the sequences complementary thereto, or a fragment comprising at leas, 10, 15, 20 25, 
30 35 40 50 75 100, 150, 200, 300, 400, or 500 consecutive bases of SEQ ID NO.: 3, and 

25 sequences—i^^ 

acidsmaybecDNAsorgcnomicDNAsfromorganismsotherthanmeonefromwhtchthe 
nucleic acidwas isolated. For example, the other organisms may be related organisms. In such 
pr ocedures,anuc>eicac,d sample is contacted withthe probe under condttions which pernut the 

30 probetospecificallyhybridtzetorelatedsequences.Hybridizattonof.heprobetonucle.cacds 

from the related organism is then detected using any of the methods described above. 
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whether one of the nucleic acids is immobilized, for example, on a filter. 

Hybridization may be carricdout under conditions ofiow stringency, moderate 
a., ™ examDle of nucleic acid hybridization, a polymer 

05% SDS iOXDenhardfs.andO.Smg/mlpolyriboadenylicacid. App—ely 2 X 10 P» 
f ,49X10' cpm/ug) of -P end-labeled oligonucleotide probe are then added to 
the solution. After 12-16 hours 01 in Na 2 EDTA) 
♦ ■ • an W SDS followedbya30minutewashinfreshlXSM at i m iu 

hybridization signals, 
^l^ehybno^^ 
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For probes between 14 and 70 nucleotides in length the melting temperature Cm) is 
e^atedlmgthe — 
the length of the probe. 

jfmehybridt.ationiscarriedoutmasolutioneontainingfonnamide^^ 

(0.63% formamide)-(600/N) whereN is the length of the probe. 

Preh y b ri dization m a y beea rf edoutin6XSSC,5XDenhardt,rea g ent,0,%S D S, 1 00u g 

IturedCnteds^onspermONA.SOo/.—de. ..formulas for SSC^dDenhardts 

solutions are listed in Sambrook et al, supra. 

Inpiementary thereto or homologous thereto. Forprobes over 200 nudeotides tn ength, the 
™ hybridrzationmaybeearriedoutatlS^-CbdowtheT.Forshorterprobes.suchas 
I olnudeondeprobes, .ehvbHdizationmaybecondueteoatS^Cbelow.eT «for 
" hvb ridiza«onsi„6XSSC,mehyb ri d^ U-y.for 
hyb ridizationsm50%formar»^ 
approximately 42°C. 

Allofthe foregoing hybridizations would be considered to be under conditions of high 



stringency. 



30 



FoUowing hybridization, the filteris washed to remove any non-specificaUy bound 
d etect*leprobe. The stringency used to wash the filters can a,so be vaned depending on .he 
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, ■ 'A wne (e p RNA v DNA). Examples of progressively higher stringency 
the nucleic acid type (e.g.,KN A v. u«*v r _:„.,.,= How 

coition washes are as Mows: 2X SSC, 0.1% SDS a, room temperature fori nnnutes (low 
1 Iv 01XSSC 0 5 %SDSatr W mtem P eraturefor30r fl inu.estolhour(moderate 

stringency), anH 0 ISMNaCl for 15 minutes at 72°C (very high stnngency). A 

5 ^(BghsKingency); andO.lSMNaC e The examples above 

would know that there are numerous recipes for different stnngen y 
are given below. 

conventional techniques. 

I nomologytotheplese^ence. For example, to obtarn nucleic acids of decreasmghontologyto 
I r^leprbCcssstringentcondiuonsmaybeused. For example hybridizahon 

"O "l + c ncentranonof approximately 1M. Fonowinghyb ri d, Z ation,me ^rmaybe 

1 ^ w ,th 2 XSSC,0,%S D Satthetempera t ureof^ation. ^ 

S con.ideredtobe'^te-condinonsaboveJO-Cartd-low-condihonsbelowSO C A p cfic 

? e^of—te-hybHdi.tioncondiuonsiswhcndteabovchyb^ 

hybridization is conducted at 45°C. 

„ „ f A?°r lnthiscase,theconcentrationofformamideinthe 
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133 



conducted at 10% formamide. 
seq— tlea,^^^ 

===53=====* 

15, 20, 25, 30, 35, 40, 50, 75, 100, 150, zw, , alignm ent algorithm. 

75/., at least nf<!F oiDNO • 4, and sequences substantially identical 

0 polypeptide having the sequence of SEQ | < 

IU ,hereto,orfragmen.scompris,ngat 1 east 5 ,10,15 > 20 > 25 > 30 ; ' h 

ft con ^^^^-««^^•^^■ I, ^ 

'.: astheFA STAversion3.0t78al g orithmwi,hthedefault P ara m e«ers). 

For example, the expression ^^^'^^^^ 
30 translation im to The vector m y 

sequences for amplifying expression. 
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plsphatasepromote, Fungal indude the V facto, p—. 

LaUothionein-Ip— . Other promoters known to control expression of genes n, 
prokaryotic or eukaryotic cells or their viruses may also be used. 

—on scuences, and 5' flanking nontranscrihed sciences. In son. em tata-* ON 
required nontranscribed genetic elements. 

Vectors for expressing the polypeptide or fragrnent thereof in eukaryotic cellsm^ also 
, usuallyfrornaboutiOtoaboutSOObpinlength.hatactonapromoterto.ncrease.ts 

,00 to 270, thecytomegalovrruseariyp— enhancer, thepolyomaenhanceronthelates.de 
of the replication origin, and the adenovirus enhancers. 

' .operm.tselectionofhostcenscontainingthevecto, Such selectable markers 



TRP1 gene. 
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Insom ee m W im e» t s, fll e„uole i cao id enco dl n g SEQ I ONO. 4 «ces 

^ 40 50 75 100 or 150 consecutive amino acids thereol is raseu 
such as increased stability or simplified purificabon. 

^ appropriateDNAsecruencetnay be inserted into the vector by a variety of 
dures in leral the DNA sequence is ligated to the desired positton » the vector 

. + v mm the entire disclosures of which are incorporated herein by 

art. 

Thevectornrayhe^orexample.inthefonnofaplasmid.aviralparticle.oraphage. 

Othervctlldeohr — 

, u nxr A viral DNA such as vaccinia, adenovirus, fowl pox 

l^,ColdSprin g Harbor,N.Y.,(.989), f hedisclosureo f wh 1 ch 1 sherehy 
incorporated by reference. 

BoU^P— ba^ 
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vectormay boused as longas it is replicabie and viable tn the host eel!, 
i skilled in the art. 

I s enetransfer Particular methods include calcium phosphate transfection, DEAE-Dextran 

* Methods in Molecular Biology, (1986)). 

25 rinesoftheinvention. Following transformation of a suitable host stram and growth of 4e 
means(, g ., temperature shifter chemical induction) and the cells may be cuto dforan 

.dthelltingcLextractisretainedforfurtherpuriftcatto, Microb.al cells employed for 
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phosphocellulosechromatog^^ 

Various— ian cell culture system can a!so be e m p toy ed t o express reeo— 

protein, nx p and other cell lines capable of 

fibroblasts (descnbed by Gluzman, Cell, 21175, 198! CHO, HeLa and BHK cell 

expressing proteins from a compatible vector, such as the CI 27, 3T3, CH 



lines. 



also include an initial methionine amino acid residue. 

■ ■ , , , S !0 15 20 25 30, 35, 40, 50, 75, 100, or 150 consecutive ammo adds 
comprising at least 5, 10, 15, ZU, a, ju, , 

employed as intermediates for producing the full-length polypeptides. 
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DNA construct comprising a promoter operably linked l ° be^inearized 
the desired polypeptide or fragment thereof, 
consecutive amino acids thereoi. 1 nc ici 

J synthesis or modification procedures. 



natural isolates. 



30 



For example, variants may be created using error pronePCR. In error prone PGR, PCR 
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PCRproduct.Forexample, the — may b e perfo m edus,„ g 20 ta ol «rf 

, f ,,-hPCRnrimer a reaction buffer comprising 50mM KC1, lUmM 
mu<agenized,30p m oleofeachPCRp™«, 05mMMnC , 2 , 5 unitsof Ta qP o ly merase, 
TrisH C.( P H8,)and0.0 1 % g ela« m ,7n 1 MM g C^mMNta ^ ^ 

nucleic acids is evaluated. 

~,uwpn? Assembly PCR involves the 
Ano.hermethodforgeneratingvanan. sl sas S cmblyPCR. Assembly 

m tfl PCR D roduc.fromamixtureofsmallDNAfragments.Alargenumberofd,fferent 

25 -^ ofaPCRpr0dUCt ;2 th e S amevia.wi«i.he P roduc.sofo„ereactionp^ 
PCRreactionsoccurmparallelinthesamevial P ^ 

Ly 9, 1996, entitled, "Method of DNAReassemUy by Interrupting Synthesis , 
which is incorporated herein by reference in its entirety. 
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highly related ^^^j™^ foii 0 wedby frxation of the crossover by primer extension 

based on sequence homology, followed by £1:10747- 

5 ^reaction. ^™<^* M *»Z^^.** 

proceduresapiurahtyofnuclerca ^° Fragm en,s of the desired average size 

^ntshavinganaverages, «0-20- ' ^.^^ 
are purified and resuspended m a PCR mixture, r 

, f v, HMTP 2 2mM MgC12, 50mM KCL, lOmM Tns HC1, pH y.u, an 
• Q oligonucleotrdesn^berncl^ 

i =— — — 

I activities of the polypeptides they encode are assessed. 
V— yalsooec^ 
bacterial strain, such as an E. coll strain, mutation rate than that of a wild- 

tyP e parent. Propagatmg the DNA mutag enesis are described 

•f^t^nNA Mutator strains suitable tor use ior in viv & 

30 herein by reference in its entirety. 
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and/or partially randomized native sequence. 

3 herein by reference in its entirety. 

-H Biotechnology Research, 1U548-155Z, , 

i== Current Opinion in Biotechnology, 4.450-455, 
herein by reference in its entirety. 

======= 

ttota^De'sted Acti^t!™by Mutagenesis", both of which are incorporated herein by reference. 
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The variants of SEQ ID NO, 4 may be variants in which one or more of the amino acid 
residues of SEQ ID NO, 4 are substituted with a conserved or non-conserved amino acd resume 

not be one encoded by the genetic code. 

Conservativesubstitutions arethose that substitute a given amino acid in a polypeptide 

Thr eonineorvice versa; replacement ofan acidic residue such as Asparttcactd and ~c 

residue such as Phenylalanine, Tyrosine with another aromatic resrdue. 

Other variants are those in which one or more of the amino acid residues of SEQ ID NO, 
4 includes a substituent group. 

| eompound.suchasacompoundtoincrcasemehalf-lifeofmepo^ 
polyethylene glycol). 

Additional variants are those in which additional amino acids are fused to the 

25 IJencewMchfacilita^^ 

functionor activity as SEQ ID NO, 4, and sequences substantially identical thereto toother 
^intents, the fragment, derivative, or analog includes a proprotein, such that, he fragment, 

polypeptide. 



P 
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q 
Sb 



25 



========= 

compnsmg at least 5, 10,15,20,25,311, flhf , nrMrams described above which aligns 

.Hereof. Homology ^'^^''ZZZl*^*** 

the polypeptides or fragments being compared and determmes the 
liZbetwecnthem. will be appreciated that amino acid "homology mcludes 
conservative amino acid substitutions sneh as those described above. 

acids encoding them nsing the techniques described above. 

Alternative y, Qpnuen ce of potentially homologous 

!Zi:; — 5 ,;o,l 5 , 2 0, 25 ,30,3MO, 5 0, 7 S,,00,orl 5 Oconsecutive.moac 1 ds 

thereof using any of the programs described above. 

• ■ co„ fnr identifvine fragments or variants of SEQ 
Another aspect of the invention is an assay for identifying rr g 
!DN0 TI sciences — ally identtcal thereto, wHch retain me er.yn.ttc tunchon o f 
™» 4 Isecuencessubstantiallyidenttcaivnereto. For example the fragments r 

the fragment or variant retains the enzymatic activity of SEQ ID NO.: 4. 



30 



Gray Cary\GT\6228967.1 
104703-156590 



-51- 



polypeptide and substrate. 

, ,•„ of SEO ID NO • 4, and sequences substantially identieal thereto or 
Thepolypeptide fSEQIDNO m „ 150 eon.ecut.ve amino 

t, ^rnrpdures a substance containing a glycoside uncage v s 

i starch) is contacted with SEQ ID JNU..*, 4 

condiuonswhichfacUitatethehydrolysisoftheglyeosidichnkag, 

, t'M nfSE01DN0'4 and sequences substantially identical thereto or 
The polypeptide ° ^ ^ ^ 20 25 30 35, 40, 50, 75, 100, or .50 consecutive amino 

- til — ^ 

nabiological sample. In such procedures, a proton preparation, such as 

30 35 40 50 75, 100, or 150 consecutive amino acids thereof. 
30 conditionsinwhichtheantibodyspecificallybindstoSEQIDNO.. 



(2) 
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ft ™, m t thereof After a wash to remove non-specifically 
substantially identical thereto, or fragment thereof. Aft 

bovmd proteins, the specifically bound polypeptides are etated. 

irid „ueEUSA assays, sandwich assays, radioimmunoassays, and Western Bio 
consecutive amino acids thereol can nonhum an. The 

J from cells expressing that polypeptide. 

I .or.eparationo—ant^a,— e 

* produced by — ^ ^ ^ 

technique(KohlerandMi 1 ste 1 n,Nature,^.495 497, doma technique 

(Kozbor^a^ImmunoiogyTodayi^ , 19g5> Monoclo „al Antibodies and 

25 — andtheEB^— 

Cancer Therapy, Alan R. Liss, Inc., pp. 77 96, 



reference). 



produce single chain antibodies to SEQ ID NO.. 4, 
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, ,0 15 20 25 30, 35, 40, 50, 75, 100, or 150 consecutive 
or fragments comprising a, leas. 5, 10, 15, 20 25, 3 
amino acids thereof. Alternatively, transgenic mice may be used 
antibodies to these polypeptides or fragments thereof. 

, ■ „ SFO ID NO • 4, and sequences substantially identical 
Antibodies generated against ^ ^ ^ ^ „ 30 , 35 , 40, 50, 75, 100, or ,50 

consecutive amino acds thereof may be ^ ^ 

onanisms andsamples. In such techniques, ^ ^ ^ 

me antibody and those polypeptides which ^ 

| the nucleotide sequence ^ • 3 and fragments thereof and sequences complementary 

u sequenceshomoiogoustoSEQlDNO^ ^ ro m: 3. e-^ri*,- 

25 nucleic acid sequences as set forth in SEQ ID NO. correc ti 0 n of a sequencing 
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toetcorftagmentsofanyoffte 96% , 95 . /o , 9 0V„, 85%, 80%, 75%, 70%, 

65%,60%,55%,or50%homologytoSEQIDNO..4 * 3 0t78 with fhe 

aefaunpar^etersorwi.hanymod.fiedp— .^ehomo^ 



in a sequence. 



25 



subs^tiaUyidenttcattodcSEQIDNO.. , q teeon at least 2 , 

5, 10, 15, or 20 nucleic acid sequences as set forth in id 
identical thereto. 

computerread^emediumhavingr^rdedteeonSEQIDNO.^, 
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te eo„ at leas. 2.5.R15, or Motto sequences as set forth abov, 
media known to those skilled in the art. 

J N0 ,,a—— a — Xi::rL Pr oeess tag ,acces Stag 



(S) 



25 



30 



z:;;-~— ■ — — — 

no. 
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(< ,g.,vrathe internet) etc. Insome a CO mpact disk, a magnetic tape, etc. 

i* computer system 100. 



execution. 



20 



30 



-57- 



GrayCary\GT\6228967.1 
104703-156590 



^-===~-^ 

but areby no means hmlted .o, TELA m BL „ , ^ , Mol . 

(Pearson and Lipman, Proc. Natl. Acad. Sci. USA S5(8).2444 

B,o, 2 1 5(3 ): 403-4 1 0, 1 990; Thompson ^/..Nucleic Acids Re, 22(2):4 7 -46 0 9 

10 410, 1990; Altschul et at, Nature Genetics 2:266-272, 1993). 

5 t , Tn w rePackageofmeGeneticsComputcrGroup,Univers 1 «yofWtscons,n 
| B— logyCenter.mO — 

£ similar sequences by assigning degrees of homology context of two or more nucleic acids or 
inspection. 

program parameters. 
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f rencesefluenceoLs^enumberofoontiguouspositionsafterAeWosequenoesaxe 
optmallyahgned.Metltodsotaig™ „. , ucted eg by the local homology 

offtesealgontostGAP.BEStoTF by man ual alignment and 

, , , ,^ ALIGN AMAS (Analysis of Multiply Aligned Sequences), AMPS 
Q Biolog^al Informal), ALIGN, AMA ( y ^ Evaluation Too l), 

* (ProteinMultipleSequenceAltgnmentVASSEKAligneabgra 

1 (BLoc k sIMProvedSea r cher),FASTA,lnterva 1 s & Po I n,,BMB,CLUST^V, 

O CONSENSUS LCONSENSUS, WCONsW Smith-Waterman algontlun, DARWIN, 

Vegas algonthm,hNAl(,rorc v i v d« Package) GAP (Global Alignment 

!i DYNAMIC, FILTER, FSAP (Fnstensky SeAence Analysts Package), GAP ( 
iii , ^GENALGIBBS.GenQuest.ISSckensitiveSequenceCompansonXLALIGN 

■c liable as part of the Human Genome Sequencing Project (J. Roady^ 
genome is available as pan 01 Ji^^n^s 1995). At least 

uww he i u W a 3hing tettx da/- iu aclVhumo fi~ g e ne meyragt^^^M 

(Adams e <a;.,2000). Significant progress has also been ma!|e m sequencmg 
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5 krtrW/ww.ncbi.nlrnj u^ ' 




teoughtheNaWnalWforB.otechnology y identifying short words 

^^^^^^^^^^^^^ 

-T-*-- ™ i^Tnesernitiainersnoornoodwordi^aet 
neighborhoodwordscorethr>shold(Altschui e r y ^ wor d hits are extended in 

C— Po^oacidseo.uences.ascoringmatrixisused.o 
eaieuiatethe— vesco*. ^ ^ achi eved vaiue; the 
cumulativescoregoestozeroorbdow.duetothe The BLAST algorithm parameters W, 

T .dXdetermmethesensi^andspeeLftheahgnment. TheBLASTNp ' * < 

M=5j N= -4, and a comparison of both strands. 

TheBLAM aigorn r on-5873 1993). One measure 

30 se q uences(see,,g.,Kar li n & A 1 ,sehul > Proc.Nat,.Acad.Sc 1 .USA 2fl ,87, 



25 
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aedbvBLASTateoritoisAesmallestsumprobabUityfPOTXwhichprovides 
ofsirnilantyprovidedbyBLAM aigomm seq uences 



0.001. 



f (4) TBLASTNcomparesaquerypro.einsequenceag.nstanuoleottde 

^ seque ncedatabaset r ans 1 a,e d i»alls 1 xreadi„gfta m es(bo t hstrands);a I ,d 
!i " , ; ^ASTXco^ares^s^e^la.io.ofanudeondequery 

I! S equenceagains,thesix-f^e— 

f ,U> I^TheBLAsWograms identify homologous sequences by identifying similar segments, 
\Mb/ The BLAS program ™„„,„ airs " between a query amino or nucleic 

acid sequence and a test^equence wniui ia , 

jilicmed'l bv means 

ft^&j—^S^'* 1 T NationalLibrary „fMedicin % ^ 
30 BLAST programs are accessible through ti^e U.S. National Lib ry 
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The parameters used with the above algorithms may be adapted depending 
P^isatWtagram — ^'^^^^ 
such as GENBANK that is available through the Internet. 

„,«.»b.«»i-«i""«'"'™"""™'^" . „ 
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, ■■ « tat e2 1 8whereinadeterminationismadewhethermoresequencesex,stmthe 



10 in the database. 



KshouidbenotedtMtifadeternnnatio „ had been made at the decision state 2! 24-*. 
9 andsequencessub—yid^ 

1 —-1 a llyl den, 1 ca lte eto,ada te s,o r a g edevice h av 1 n g — 

P ^cacidsequenceasset^htnSEQ^^^^ 

25 llintheabovedescribednucieicacidcodeofSEQIDNO^.ands^encessnbstant* 

25 moms m uw a sequences 
identicaltheretcorapolypeptidesequeneeassetforthmSEQlDNO.. « 

snbstantiallyidentical thereto, or it may identify strueturai m ot,fs ,n sequences wtach^e 

— -*-^™^?»-- 

s^de.cen^ayhavesto.d— 

30 more of the nucleic acid sequences as set forth in bbg id 
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^^«*. ^ ^^"-« taOT ^ ,,,d, '' , ■"" 

substantially identical thereto. 

Another aspeciuiu. sequences 
SEO ID NO • 3 and sequences substantially idenfcal thereto, or SEQ ID NO q 

enumeratedherem, (e.g.,BLAS12JN wi Themethodmay 

; nucleic acid sequences as set forth m SEQ ID NO.. V ^ 

^ fteSEOIDNO^throughuseofthccomputerprogramanddetermnunghomology 

sequences. 



(2D 



state 262 wherein the 



wnerem uw 



:m the tirsi cnaiaL^i v,* ^ - 

isanucleotide sequence, then 4e character would normally be etther A, T, C,G 

or „quence is aprotein sequence, then it is preferably m the 

e„de so that the first and sequence sequences can be easily compared. 

^e.lf.heyarethesame.thentoprocess.Omovestoastate^wheremthcnex, 
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1 



next characters are the same. If they are, then t ^ ^ 

a Tf a rletermination is made that the nexiiwu^^ 

rr^r^-.-L-..- — - — 

characters either sequence to read. 

« q „ m .1b.l™»l«'"- l ™" l, " xl% 



ISP 



25 



30 



GrayCary\GT\6228967.1 
104703-156590 



, tt „_,„ ww - «»:ri rrir* 

0 sequence as set forth in i>n^ ^ ^ 

3 ' dent,f tf rtllinSE OIDNO,3 > andse q uencessub S «antial1y I dent 1 ca 1 

1 nucleic acid sequence as set forth ,n SEQ K NO , 

I! theretcorapolypeptides^enceasse.forth.nSEQIDNO^ 4 ich identifles ffl 

ft .denticalthereto. 'Z^Z^^ 
I openreadingframeinanudeicacidsequenceassetforthmSEQIDN 

U substantially identical thereto. 

25 Codon-andtheattnbutewouldYTG . a» example of such a 

^ eiSPr0dU f by "X poiypep.ideu.otfssuchasaiphahelices.be.a 
30 other motifs known to those skilled in the art\ 
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a at thP .state 306 the process 300 moves to a state 
Once the database of features is opened at the state 3Ub, me p 

• a fi™ the database A comparison of the attribute of the first 
™^*™^»^^"^ 3l0 Adeterm inationisthenmadeata 
feature with the first sequence is then made at a state 310. A determina 

feature is displayed to the user. 

♦ ,wi « nn state 320 wherein a determination is made 
The process 300 then moves to a decision state jzu wn 

3 if any more features exist in the database. 

l» According*, .^sequences substantially identical 

U uucleicac.dsequenceassetforthtnSEQro ; 4> Md sequenc es substantially 

g, the.to.orapolypeptlde^^^^^^ 

sequences as set forth in ^ ^ ^ se q Uences substantially identical thereto; 

through the use of the computer program and identifying 
or polypeptide codes with the computer program. 



30 
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Ar .ucieic acid sequence as set forth m SEQ ID NO, 3, and sequences sub— 

• „f formats For example, a nucleic acid sequence as set forth m SEQ ID 

SHQ^O- andsequences— ia, ly identical, hereto, may be stored as text m a woM 

— toto i^tl^ Lpansona, g o ri thms, 

to a nucleic acid sequence as set forth in SEQ ID JNU , llbsta ntiallv 
+ m ^PO TD NO ' 4, and sequences substantially 

^databases which are useful with the nucleic acid sequences as set forthmSEQm 

SEQ ID NO.: 4, and sequences substantially identical thereto. 

FASTA(PearsonandLipman,Proc.Natl.Acad.Sc,USA,S5.2444 
Icom-App-Biosc,^,^^^ 

^ecularStaulauons^^^ 
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given the present disclosure. 
.eucineztppe^eHxW.^^^ 

leasLs signal sequences encoding signages which direct *e S ecre«on of the 

| Theoiocatalyticreactionsproduceapopulationofderivative^rn^ 

* Id ThesedLativescanbesubjec.edtoano.hcrroundofbiocatalyttcreact.onsto 
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5 



soiuuon using virtually any type of screenmg assay. It mport 
en^aticreactionstha^ 

current chemical methods. 

.nolecules, comprising contaotmg a polypepUde en ded y P yn 

*a Thr hiocatalvtic reactions are conducted wnn d giu F 



25 



30 



moiety. 



h0WCT e r ,i.is tobe—odu,, the— n is noticed to such exampies. 
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EXAMPLES 

EXAMPLE 1 



Production of the Expression Gene Bank 

ftomXZAP2to P BluescWThedones 5 . 16)Theresu lti»g colonies were p.cked 

Q The wells contained 250 uL of UBinedia with 37°C without shaking. This constituted 

ii ^^™*- B ^.^. , 2^p 1 » idwi,hauni q ueDNAinsert. 
| cult ureof£co/Ue«s,eachofwluchcVnedapBlue S cnptp 



', EXAMPLE 2 

| Screening for Glycosidase Activity 



®u ay W « „reneBankwereusedtomulti P lyinoculateasingleplate(the 

ofLBAmp/Med.g.ycerol.Thisstepwas 
^ "Condensed Plate") coW >» ^ , ^BeckmanBiomekwith a 1% 

per—ngtheHi^*^ 

25 , h eCondensedPla.ethusconta,nedlOto ^ then used to inoculate two 

Amp/Meth (without glycerol). The originate 
condensed daughter plates were incubated atV C for 18h. 



30 
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A-600 uM substrate stock solution" was prepared as follows: 25 mg of each o our 

expounds „^**W^^^»^» 3 ' M '** aLl * 

compounds used were 4-methylurnbd^^^^^^^^ 

g alac,oside,4-me*y— ^ 

Lopy^oside.Twohu.tdred fifty microUtersofeachDMSOsoiunon was added ,oca.9mL 
Fifty uLofthe '600 uM stock solution' was added to each of the weUs of a white 

W ere subtracted to determine if an active Cone was present by a„ increase m fluorescence over 
the majority of the other wells. 

EXAMPLE 3 

Isolation of Active Clone and Substrate Specificity Determination 

ta order to isolate the individual clone wWch earned the activity, the Source GeneBank 
LB /Ar„p/Me,h.Asabove^^ 

A20 C*ali q uot was removed from each well and assayed with the substrates as above. The 
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^stac.iveo.onewasusedtoinocula.eS^cu.turesofLB/Amp/Me.h, which were gro™ 

negative controls. A 600 uM solution of each individual substrate was made up toft. purpose 
of detemnning the — specificity of the enzyme, fifty uL of each of 
were added individually to the test and controlwells and assayed for activity as ahove. Only the 
wells whrch contained the ^methylumbelliferyla-D-galactoside showed an increase ,n 

fluorescence indicating activity. 

Whiletheinventionhas been described in detaU W1 th reference to certain preferred 
events thereof, it w.llbe understood thatm 
and scope of that which is described and claimed. 
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